Adapting k-means for graph clustering
نویسندگان
چکیده
Abstract We propose two new algorithms for clustering graphs and networks. The first, called K?algorithm , is derived directly from the k -means algorithm. It applies similar iterative local optimization but without need to calculate means. inherits properties of in terms both good capability tendency get stuck at a optimum. second algorithm, M-algorithm gradually improves on results K -algorithm find potentially better optima. repeatedly merges splits random clusters tunes with -algorithm. Both are general sense that they can be used different cost functions. consider conductance function also introduce functions, inverse internal weight mean . According our experiments, M outperforms eight other state-of-the-art methods. perform case study by analyzing disease co-occurrence network, which demonstrate usefulness an important real-life application.
منابع مشابه
Graph based k-means clustering
An original approach to cluster multi-component data sets is proposed that includes an estimation of the number of clusters. Using Prim’s algorithm to construct a minimal spanning tree (MST) we show that, under the assumption that the vertices are approximately distributed according to a spatial homogeneous Poisson process, the number of clusters can be accurately estimated by thresholding the ...
متن کاملAdapting k-means for Clustering in Big Data
Big data if used properly can bring huge benefits to the business, science and humanity. The various properties of big data like volume, velocity, variety, variation and veracity render the existing techniques of data analysis ineffective. Big data analysis needs fusion of techniques for data mining with those of machine learning. The k-means algorithm is one such algorithm which has presence i...
متن کاملK-Means Graph Database Clustering and Matching for Fingerprint Recognition
The graph can contain huge amount of data. It is heavily used for pattern recognition and matching tasks like symbol recognition, information retrieval, data mining etc. In all these applications, the objects or underlying data are represented in the form of graph and graph based matching is performed. The conventional algorithms of graph matching have higher complexity. This is because the mos...
متن کاملPersistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملBalanced K-Means for Clustering
We present a k-means-based clustering algorithm, which optimizes mean square error, for given cluster sizes. A straightforward application is balanced clustering, where the sizes of each cluster are equal. In k-means assignment phase, the algorithm solves the assignment problem by Hungarian algorithm. This is a novel approach, and makes the assignment phase time complexity O(n), which is faster...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge and Information Systems
سال: 2021
ISSN: ['0219-3116', '0219-1377']
DOI: https://doi.org/10.1007/s10115-021-01623-y